Serial No. 09/992,987 
AMENDMENTS TO THE CLAIMS 



1 . (currently amended) A method for maintaining records in a database 
comprising: 

(a) receiving at least a collection of first data items and a collection of second 
data items[[,]] at l e ast on e of th e first data it e ms b e ing associat e d with on e of th e socond 
data it e ms ; 

(b) disposing the first data items in a plurality of fields arranged in a 
predetermined format to form a first assemblage; 

(c) disposing the second data items in a plurality of fields arranged in the 
predetermined format to form a second assemblage; 

(d) modifying selected ones of the first data items and the second data items 
to conform to predetermined nomenclature; 

(e) maintaining a record, in the database, having a plurality of fields arranged 
in the predetermined format; and 

(f) determining whether a first particular data item in the predetermined 
nomenclature in a selected field of the first assemblage is identical to a second particular 
data item in the predetermined nomenclature in a field of the second assemblage 
corresponding to the selected field: and 

ffi(g) if it is determined that the first particular data item in the predetermined 
nomenclature is identical to the second particular data item in the predetermined 
nomenclature, including in a field in the record a selected one of (1) the first data items in 
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the predetermined nomenclature and (2) the second data items in the predetermined 
nomenclature. 

2. (original) The method of Claim 1 wherein at (b) and (c) the first data 
items and the second data items are separated, rearranged, or combined to form the first 
assemblage and the second assemblage respectively. 

3. (currently amended) The method of Claim 1 wherein at (f)£g) the first 
data items and the second data items are assigned accuracy rankings and are selected 
based upon their accuracy rankings. 

4. (currently amended) A system for maintaining records in a database 
comprising: 

a communications interface for receiving at least a collection of first data items 
and a collection of second data items[[,]] at l e ast on e of th e first data it e ms being 
associat e d with on e of th e s e cond data it e ms ; 

a converter for disposing the first data items in a plurality of fields arranged in a 
predetermined format to form a first assemblage, and for disposing the second data items 
in a plurality of fields arranged in the predetermined format to form a second 
assemblage; 

a normalizer for modifying selected ones of the first data items and the second 

data items to conform to predetermined nomenclature; 
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a database for maintaining a record, in the database, having a plurality of fields 
arranged in the predetermined format; and 

a data masher for determining whether a first particular data item in the 
predetermined nomenclature in a selected field of the first assemblage is identical to a 
second particular data item in the predetermined nomenclature in a field of the second 
assemblage corresponding to the selected field, and for including in a field in the record a 
selected one of (1) the first data items in the predetermined nomenclature and (2) the 
second data items in the predetermined nomenclature , if it is determined that the first 
particular data item in the predetermined nomenclature is identical to the second 
particular data item in the predetermined nomenclature . 

5. (currently amended) The system of Claim 4 wherein each of the of th e 
first assemblage of data items and the second assemblage of data items in the 
predetermined nomenclature are associated with data accuracy rankings. 

6. (currently amended) The system of Claim 5 wherein the data masher 
s e l e ct e d selects the first data items and the second data items based upon the data 
accuracy rankings. 
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7. (currently amended) A system for maintaining records in a database 
comprising: 

a converter for converting at least a first collection of fkst data items and a second 
collection of s e cond data items to at least a plurality of fi e lds arrang e d in a prcdot e rmin e d 
format to form a first assemblage of data items and a second assemblage , respectivelv. 
the first and second assemblages each having data items in fields which are arranged in a 
predetermined format of data it e ms, at least on e of th e first data it e ms b e ing asGociatod 
with on e of th e s e cond data it e ms ; 

a normalizer for converting selected data items ems of the first assemblage ef 
data it e ms and the second assemblage of data items to conform to a predetermined 
nomenclature; 

a database for maintaining a record[[,]] in th e databasc [[,]] having a plurality of 
fields arranged in the predetermined format; and 

a data masher for determining a value representing a number of corresponding 
fields in the first assemblage and the second assemblage having identical data items 
therein, the data masher based on the value selecting at least one of the data items in the 
first assemblage and the second assemblage to form the record s e l e cting at l e ast on e data 
it e m from th e first ass e mblag e of data it e ms in th e pr e d e termin e d nom e nclatur e , selecting 
data it e ms from th e s e cond ass e mblag e of data it e m s in th e pr e d e t e rmin e d nom e nclature, 
and combining th e at l e ast on e data it e ms and th e data it e ms from th e s e cond ass e mblage 
to cr e at e th e r e cord . 



Serial No, 09/992,987 

8. (original) The system of Claim 7 wherein the converter separates, 
rearranges, or combines data to form the first assemblage of data items and the second 
assemblage of data items. 

9. (original) The system of Claim 7 wherein each of the of the first 
assemblage of data items and the second assemblage of data items in the predetermined 
nomenclature are associated with data accuracy rankings. 

10. (currently amended) The system of Claim 9 wherein the data masher 
s e l e cted selects data items in the first assemblage and the second assemblage th e first data 
it e ms and th e second data it e ms based upon the data accuracy ratings. 

1 1 . (currently amended) A system for maintaining records in a database 
comprising: 

a converter for converting first records containing data items to uniform records 
having the data items organized in a uniform format, a coll e ction of first r e cords having 
fi e lds and data it e ms into uniform r e cords having fi e lds and data it e ms organiz e d into a 
uniform format and wh e r e e ach of wherein the uniform records have one or more of the 
data items conforming to a predetermined nomenclature and a balance of the one or more 
of the data items not conforming to the predetermined nomenclature; 

a normalizer for converting the balance of the one or more data items to the 

predetermined nomenclature, and producing a collection of second records each having 
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selected data items organized in fields according to the uniform format, the selected data 
items conforming to the on e or mor e of th e data it e ms that conform to a pr e d e t e rmined 
nom e nclatur e and th e balanc e of th e on e or mor e data it e ms that ar e in the predetermined 
nomenclature; and 

a data masher for selectin g, from among the second records, at least a first 
selected record and a second selected record based on a value representing a number of 
corresponding fields in the first selected record and the second selected record having 
identical data items therein, and for selecting data items from the first selected record and 
second selected record to form a third data record, the third data record being e quivalent 
r e cords from th e coll e ction of s e cond r e cords and s e l e cting data it e ms from the 
equival e nt r e cords for combining into a third record having th e s e l e ct e d data items 
arrang e d in th e uniform format and stored in the database. 

12. (original) The system of Claim 1 1 wherein the converter separates, 
rearranges, or combines the fields and the data items, from the first records, into the 
uniform format. 

13. (original) The system of Claim 1 1 wherein before the normalizer 
converts the balance of the one or more data items it separates the one or more data items 
into components. 
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14. (currently amended) The system of Claim 1 1 wherein the data masher 
s e l e cts e quival e nt r e cords by comparing th e data it e ms in th e s e cond r e cords on a fi e ld 
by fi e ld basis and d e t e rmin e s a confid e nc e l e v e l valu e of th e s e cond r e cords bas e d upon 
th e similariti e s of th e data items in th e s e cond r e cords, and d e t e rmining determines that 
the s e cond r e cords first selected record and the second selected record are equivalent if 
the confid e nc e l e v e l valu e s ar e value is above a predetermined threshold level value. 

15. (currently amended) Soflware recordable in a tangible medium which 
includes machine readable instructions for performing a process for building a database, 
which stores records corresponding to a plurality of data items, the process comprising: 

(a) receiving at least a collection of first data items and a collection of second 
data items[[,]] at l e ast on e of th e first data it e ms b e ing associat e d with one of th e s e cond 
data it e ms ; 

(b) disposing the first data items in a plurality of fields arranged in a 
predetermined format to form a first assemblage; 

(c) disposing the second data items in a plurality of fields arranged in the 
predetermined format to form a second assemblage; 

(d) modifying selected ones of the first data items and the second data items 
to conform to predetermined nomenclature; 

(e) maintaining a record, in the database, having a plurality of fields arranged 
in the predetermined format; bb4 
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(f) determining whether a first particular data item in the predetermined 
nomenclature in a selected field of the first assemblage is identical to a second particular 
data item in the predetermined nomenclature in a field of the second assemblage 
corresponding to the selected field; and 

(f^ (g) if it is determined that the first particular data item in the predetermined 
nomenclature is identical to the second particular data item in the predetermined 
nomenclature, including in a field in the record a selected one of (1) the first data items in 
the predetermined nomenclature and (2) the second data items in the predetermined 
nomenclature. 

16. (original) The process of Claim 15 wherein at (b) and (c) the first data 
items and the second data items are separated, rearranged, or combined to form the first 
assemblage and the second assemblage respectively. 

1 7. (currently amended) The process of Claim 15 wherein at ffl(g) the first 
data items and the second data items are assigned accuracy rankings and are selected 
based upon their accuracy rankings. 

1 8. (currently amended) A method for maintaining records in a database 
comprising: 

converting first records containing data items to uniform records having the data 

items organized in a uniform format, a coll e ction of first r e cords having fi e lds and data 
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it e ms into uniform r e cordG having fields and data it e ms and wh e re oach of wherein the 
uniform records have one or more of the data items conforming to a predetermined 
nomenclature and a balance of the one or more data items not conforming to the 
predetermined nomenclature; 

converting the balance of the one or more data items to the predetermined 
nomenclature[[J]; 

producing a collection of second records each having selected data items 
organized in fields according to the uniform format, the selected data items conforming 
to the the on e or mor e of th e data items that conform to a pr e d e t e rmined nom e nclature 
and th e balanc e of th e on e or mor e data it e ms that ar e in the predetermined nomenclature; 
and 

selectin g, from among the second records, at least a first selected record and a 
second selected record based on a value representing a number of corresponding fields in 
the first selected record and the second selected record having identical data items 
therein: and 

selecting data items fi-om the first selected record and second selected record to 
form a third data record, the third data record being e quival e nt r e cords fi-om the 
coll e ction of s e cond r e cords and s e lecting th e data it e ms from th e e quival e nt records for 
combining into a third r e cord having th e s e l e ct e d data it e ms and stored in the database. 

19. (original) The method of Claim 1 8 wherein the fields and the data 

items, fi-om the first records, are separated, rearranged, or combined. 
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20. (original) The method of Claim 1 8 wherein the balance of the one or 
more data items are separated into components and converted into the predetermined 
nomenclature. 

2 1 . (currently amended) The method of Claim 1 8 wherein e quival e nt records 
ar e s e l e ct e d by comparing th e data it e ms in the s e cond r e cords on a fi e ld by fi e ld basis 
and d e t e rmining a confid e nc e l e v e l valu e of th e s e cond r e cords bas e d upon th e 
similariti e s of th e data it e ms in th e s e cond r e cords, and d e t e rmining that the first selected 
record and the second selected record second r e cords are determined to be equivalent if 
the confid e nc e l e v e l valu e s arc value is above a predetermined threshold level value. 



